AD-A210  829 


LABORATORY  FOR 
COMPUTER  SCIENCE 


.  MASSACHUSETTS 
'  INSTITUTE  OF 
TECHNOLOGY 


MIT/LCS/TM-SSS 


A  SPECIAL  CASE  OF 
SECOND-ORDER  STRICTNESS 

ANALYSIS 

DT1C 

^AUG  0  219m|^ 

Paul  P.  Wang 


STATEMENT  aT* 

Approved  (o:  publir  ^ 

Pistnoutioo  UcJliartod  ^ 

February  1989 


545  TECHNOLOGY  SQUARE.  CAMBRIDGE.  MASSACHUSETTS  02139 


89 


SECURITY  CLASSIFICATION  OF  THIS  PAGE 


la  REPORT  SECURITY  CLASSIFICATION 

Unclassified 


2a  SECURITY  CLASSIFICATION  AUTHORITY 


REPORT  DOCUMENTATION  PAGE 


lb  RESTRICTIVE  MARKINGS 


2b  DECLASSIFICATION /DOWNGRADING  SCHEDULE 


4.  PERFORMING  ORGANIZATION  REPORT  NUMBER(S) 

MIT/LCS/TM-383 


6a  NAME  OF  PERFORMING  ORGANIZATION 

MIT  Laboratory  for  Computer 
Science 

6b  OFFICE  SYMBOL 
(If  applicable) 

6c.  ADDRESS  (Ofy,  State,  and  ZIP  Code) 

545  Technology  Square 
Cambridge,  MA  02139 

8a.  NAME  OF  FUNDING /SPONSORING 
ORGANIZATION 

DARPA/DOD 

8b  OFFICE  SYMBOL 
(If  applicable) 

8c.  ADDRESS  fOty,  State,  and  ZIP  Code) 

1400  Wilson  Blvd. 

Arlington,  VA  22217 

3  DISTRIBUTION /AVAILABILITY  OF  REPORT 

Approved  for  public  release;  distribution 
is  unlimited. 


S  MONITORING  ORGANIZATION  REPORT  NUMBER(S) 

N00014-83-K-0125 


Office  of  Kaval  Research/Department  of  Navy 


7b  ADDRESS  (City,  State,  »nd  ZIP  Coch) 

Information  Systems  Program 
Arlington,  VA  22217 


1  10  SOURCE  OF  FUNDING  NUMBERS 

PROGRAM 
ELEMENT  NO 

PROJECT 

NO. 

TASK 

NO 

WORK  UNIT 
ACCESSION  NO. 

11.  TITLE  (Include  Security  Classification) 

A  Special  Case  of  Second-Order  Strictness  Analysis 


12.  PERSONAL  AUTHOR(S) 

Wang,  Paul  P. 


13a.  TYPE  OF  REPORT 

Technical 


13b  TIME  COVERED 
FROM  TO 


14.  DATE  OF  REPORT  (Year,  Month,  Day)  IS  PAGE  COUNT 
1989  February  19 


COSATI  CODES 


SUB-GROUP 


18  SUBJECT  TERMS  (Continue  on  reverse  if  necessary  and  identify  by  block  number) 

strictness  analysis,  abstract  interpretation,  termination 
properties 


19.  ABSTRACT  (Continue  on  reverse  if  necessary  and  identify  by  block  number) 

A  function  declaration  is  called  strict  in  one  of  its  formal  parameters  if,  in  all 
calls  to  the  function,  either  the  actual  corresponding  parameter  is  evaluated,  or  the  call 
does  not  terminate.  Approximating  strictness  analysis  with  abstract  interpretation  reduces 
to  the  evaluation  of  recursive  monotone  Boolean  functions.  This  evaluation  problem  is 
complete  in  deterministic  exponential  time  w'hen  the  functions  are  declared  with  only  base- 
type  formal  parameters,  and  is  deterministic  super-exponential  time  hard  when  functions 
are  formal  parameters.  Hov/ever,  by  coarsening  the  strictness  analysis  approximation  to 
be  conjunctive,  the  first-order  case  can  be  completed  in  linear  time.  This  paper  will 
show  that  the  second-order  case  is  NP-hard.  . 


20  DISTRIBUTION /AVAILABILITY  OF  ABSTRACT  21  ABSTRACT  SECURITY  CLASSIFICATION 

□  unclassified/unlimited  □  same  AS  rpt  □  dtic  USERS  Unclassified  _ 


22a  NAME  OF  RESPONSIBLE  INDIVIDUAL  22b  TELEPHONE  f/nc/uefe  Area  Code)  22c  OFFICE  SYMBOL 

Judv  Little,  Publications  Coordinator  (617)  253-5894  _ 


DO  FORM  1473,  84  MAR  83  APR  edition  may  be  userf  until  exhausted  SECURITY  CLASSIFICATION  OF  THIS  PAGE 

All  Other  editions  are  obsolete  ”  ‘  ~  ~ 

AU.B.  OowMMnt  Otftait  B07 

Unclassified 


A  Special  Case  of  Second-Order  Strictness 

Analysis* 


Accestof'  for 


Paul  P.  Wang^ 

MIT  Laboratory  for  Computer  Science 
February  15,  1989 


Abstract 

A  function  declaration  is  called  strict  in  one  of  its  formal  parame¬ 
ters  if,  in  all  calls  to  the  function,  either  the  actual  corresponding  pa¬ 
rameter  is  evaluated,  or  the  call  does  not  terminate.  Approximating 
strictness  analysis  with  abstract  interpretation  reduces  to  the  evalua¬ 
tion  of  recursive  monotone  Boolean  functions.  This  evaluation  prob¬ 
lem  is  complete  in  deterministic  exponential  time  when  the  functions 
are  declared  with  only  base-type  formal  parameters,  and  is  determin¬ 
istic  super-exponential  time  hard  when  functions  are  formal  parame¬ 
ters.  However,  by  coarsening  the  strictness  analysis  approximation  to 
be  conjunctive,  the  first-order  case  can  be  completed  in  linear  time. 
This  paper  will  show  that  the  second-order  case  is  AP-hard. 


NTlS  CRA&I 
OTIC  TAB  □ 

Unannounced  □ 

Jjstification  _ _ 


By _ 

Distribution/ 


Availability  Codes 

Dist 

Avail  a 
Spe 

nd  f  Of 
cial 

‘This  memo  is  a  report  based  on  tlie  author’s  Senior  Thesis,  submitted  in  June,  1988 
to  the  Department  of  Electrical  Engineering  and  Computer  Science,  under  the  supervision 
of  Prof.  Albert  R.  Meyer. 

^This  research  weis  supported  in  part  by  a  NSF  Graduate  Fellowship,  NSF  Grant  98477- 
8511190-CCR,  and  DARPA  Grant  NOOO 1 4-83- K-0 125. 


1 


1  Introduction 


Call-by-necd  evaluation  strategics  are  used  in  the  semantics  of  many  mod¬ 
ern  programming  languages.  The  advantage  of  a  call-by-need  interpreter, 
as  opposed  to  one  using  call-by-value  strategies,  is  that  normal-order  eval¬ 
uation  (which  corresponds  to  cal  I -by- need)  will  terminate  in  many  cases 
where  applicative-order  evaluation  (call-by-value)  will  not.  However,  in  cases 
where  both  stratcgica  will  tciininate,  a  call-by- value  interpreter  is  generally 
regarded  as  being  more  efficient  due  to  the  overhead  associated  with  call-by¬ 
need’s  “delayed”  representation  of  objects. 

Strictness  analysis  is  a  way  of  reducing  such  overhead.  A  function  decla¬ 
ration  is  called  strict  in  one  of  its  formal  parameters  if,  in  all  calls  to  the  func¬ 
tion,  either  the  corresponding  actual  parameter  is  evaluated  or  the  call  does 
not  terminate  on  the  usual  call-by-need  .style  interpreter.  Strictness  analysis 
determines  the  termination  properties  of  the  function  being  analyzed.  The 
results  of  such  analysis  have  been  used  to  optimize  both  parallel  and  sequen¬ 
tial  implementations  of  computer  languages.  If  a  function  is  strict  in  all  its 
formal  parameters,  then  converting  a  call-by-need  interpreter  to  use  a  call- 
by-value  strategy  for  evaluating  the  function’s  arguments  will  not  change  the 
results  of  the  program. 

Recently,  several  papers  have  offered  a  method,  namely  abstract  interpre¬ 
tation,  to  infer  an  approximation  of  strictness  of  a  function.  More  specifically, 
all  program  results  can  be  abstracted  to  the  domain  {_L,  T},  where  ±  C  T.  _L 
is  interpreted  as  “the  program  does  not  terminate”.  T  is  interpreted  as  “the 
program  might  or  might  not  terminate”.  The  operators  □  and  U  compute 
the  minimum  and  maximum  of  two  arguments.  All  base-type  constants  are 
mapped  to  T  because  they  always  terrr  rt.,e.  Also,  strict  functions  such  as 
-I-  (integer  addition)  and  A  (Boolean  and  c^.rration)  are  mapped  to  Xxy.xHy 
since  they  diverge  if  either  argument  diverges.  Conditionals  are  mapped  to 
Xpca.p  n  (c  U  a).  See  [6,  9,  1]  for  details. 

The  above  domain  {±,  T}  with  the  operators  □  and  U  can  also  be  viewed 
as  the  Boolean  values  “true”  and  “false”  with  the  Boolean  operators  “and” 
and  “or”. 

Because  many  programs  that  are  analyzed  include  recursion,  this  approx¬ 
imation  of  strictness  analysis  reduces  to  the  evaluation  of  recursive  monotone 
Boolean  functions  (RMBF’s)  [6,  9,  1].  Furthermore,  since  many  program  sub¬ 
routines  are  allowetl  to  be  declared  simultaneously  and  thus  can  “recursively 
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call  each  other”,  such  an  approximation  of  strictness  analysis  must  reduce 
to  the  evaluation  of  schemes  representing  sets  of  simultaneously  declared 
RMBF’s.  This  paper  will  define  such  schemes  (called  recursive  monotone 
schemes  or  RMS’s)  as  well  as  evaluation  decidability  problems  for  RMS’s. 

Recent  work  has  shown  that  first-order  strictness  analysis  approximation 
using  abstract  interpretation  (i.e.  the  functions  to  be  analyzed  are  declared 
with  only  base-type  formal  parameters)  is  complete  in  deterministic  exponen¬ 
tial  time  [3,  6].  Furthermore,  higher-order  strictness  analysis  approximation 
(i.e.  the  functions  to  be  analyzed  are  declared  with  functions  as  formal  pa¬ 
rameters)  is  deterministic  “super-exponential”  time  hard  [3,  12]. 

There  have  been  many  attempts  to  reduce  the  average  time  efficiency  of 
strictness  analysis  algorithms  to  more  acceptable  levels.  One  way  to  do  this 
is  to  develop  algorithms  which  take  less  time  “in  the  average  case”  [5,  2). 
Another  way  is  to  modify  strictness  analysis  by  restricting  the  amount  of 
information  that  we  can  extract  through  abstract  interpretation  [11,  7].  This 
paper  discusses  the  complexity  of  strictness  analysis  when  the  analysis  is 
coarsened  to  treat  the  conditional  as  though  it  were  an  ordinary  algebraic 
function  known  only  to  be  strict  in  its  first  argument  (i.e.  conditionals  are 
mapped  to  Xpca.p).  We  refer  to  this  as  conjunctive  strictness  analysis. 

Conjunctive  strictness  analysis  reduces  to  the  evaluation  of  recursive 
monotone  schemes  whose  syntactic  representations  do  not  contain  any  occur¬ 
rences  of  “U”  symbols.  We  refer  to  these  as  recursive  conjunctive  monotone 
schemes  (RCMS’s).  Conjunctive  strictness  analysis  in  the  first-order  case 
turns  out  to  be  a  linear-time  problem  with  respect  to  the  length  of  the  an¬ 
alyzed  function’s  syntactic  definition  [7].  However,  in  the  second-order  case 
(i.e.  when  first-order  functions  are  formal  parameters),  conjunctive  strictness 
analysis  is  AiP-hard. 

This  paper  assumes  that  the  reader  is  familiar  with  the  elementary  defi¬ 
nitions  of  domain  theory  and  computation  theory. 


2  Formal  Definitions  and  Notation 

In  order  to  design  a  scheme  that  represents  a  set  of  simultaneously  declared 
recursive  monotone  Boolean  functions,  we  first  formalize  the  domain  of  ab¬ 
stract  interpretation  (Section  2.1).  We  next  introduce  “types”  aad  “terms” 
to  formalize  our  syntax  (Section  2.2).  After  defining  recursive  monotone 
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schemes  (RMS’s)  as  representing  sets  of  simultaneously  declared  RMBF’s 
(Section  2.3).  we  then  assign  meaning  to  these  schemes  (Section  2.4). 

2.1  The  Abstract  Interpretation  Domain 

Definition  2.1  L  ct  2  br  the  domain  {T,  ±}  where  ±  C  T.  For  partial  orders 
C,D,  let  C  D  denote  the  monotone  fwnction  space.  Let  C  x  D  denote  the 
cartesian  product  of  C  and  D.  We  will  use  the  standard  "‘pointwise"  partial 
orders  for  the  sets  C  x  D  and  C  ^  D.  Refer  to  [10]  for  details. 

We  will  denote  the  least  element  in  the  set  C  (if  it  exists)  by  Fq.  (Note 
that  ±2  =  -b-) 

2.2  Types  and  Terms 

In  order  to  define  our  recursive  monotone  schemes,  we  must  first  introduce 
the  concept  of  types.  An  object’s  type  determines  the  “functional  behavior” 
of  what  the  object  intuitively  represents. 

Definition  2.2  The  set  of  types  derived  over  the  base  type  h  can  be  defined 
by  the  following  BNF  grammar: 


T  ::=  b  \  {riist)  r 

Tt,st  T  \  TUst  X  T 

Objects  of  type  b  represent  the  values  {T,  ±},  the  symbol  x  represents 
cartesian  product,  and  the  symbol  — >  represents  monotone  function  space. 
We  define  depth  of  types  as: 

Definition  2.3  7he  depth  of  type  r  is  defined  as  follows: 

•  depth(b)  =  0. 

•  dcpth({Ti  X  T2  X  ...  X  r„)  r„+,)  =  1  +  maxi<,<„+i  [c/ept//(T.)] 

Let  “.r’’’"  denote  a  typed  variable  symbol  of  type  r.  Let  and  T*"  be 
constant  .symbols. 

We  will  now  introduce  the  concepts  of  preterms,  type-statements,  and 
terms. 
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Definition  2.4  Preterms,  e,  are  given  by  the  following  BNF  grammar: 


6 

::=  var  [  X**  |  T''  j  e  U  e  |  e  □  e  |  e(e(,-,t) 

Xvar/ist.e 

var 

::=  Xq  1  x[  1  Xj  1  •  •  •  (r  is  a  type) 

variist 

::=  var  \  varn^t  var 

^list 

..  6  I  Clist^  ^ 

Definition  2.5  Type-Statements  are  pairs  e  :  t ,  where  t  is  a  pre-term  and 
T  is  a  type,  defined  inductively  as  follows: 

•  X*’  :  6  and  T*"  :  b  are  type-statements. 

•  x’’  :  r  IS  a  type-statement. 

Cl  n  62  :  r 


•  ^2  :  r 

Cl  U  62  :  T 


^  e  :  (n  X  r2  X  ■ .  ■  X  Tn)  ->  r,  6,-  :  r,-  (1  <  i  <  n) 
e(ei,62,. . .  ,e„)  :  r 


_ e  :  T 

Wy?  . .  .y^^.e  :  {ri  X  r2  X  . . .  X  r„)  r 

Definition  2.6  e  is  a  term  iffe'.r  is  a  type-statement  for  some  r.  A  term 
is  conjunctive  iff  it  does  not  contain  any  occurrences  of  “U  ”. 

Note  that  r  is  unique  if  it  exists. 

Definition  2.7  The  depth  of  a  term,  e,  is  defined  inductively  as  follows: 

•  depth{±>’)  =  depth(T^)  =  0 

•  depth{x^)  =  depth{T) 
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•  depth(ei  □  62)  =  m&x{depth{ei),depth{e2)) 

•  depth{ei  U  €2)  —  rmix{depth(ei),deplh{e2)) 

•  depth{eo{eue2 . f,,))  =  niaXo<,<„(f^ep</?.(e, )) 

•  depth{\yl^yl^  .  .  .y,7‘.f )  =  niaxo<,<„(l  +  dtpth{T,),dcpth{c)), 
ivhere  e  :  Tq. 


2.3  Recursive  Monotone  Schemes 

Definition  2.8  A  recursive  monotone  scheme  (RMS),  S ,  is  an  n-tuple  S  = 
((J■[^el),(a■2^e2),•••,(‘^•^",en)).  u'hcre  e,  :  r,  (1  <  z  <  n)  and  FV{ei)  C 

An  RMS  S  =  ((xp ,  Cl),  €2), . . . ,  e„))  represents  the  simultane¬ 

ous  declaration  of  n  recursive  monotone  Boolean  functions.  The  set  of  typed 
symbols  represent  the  declared  functions,  and  each  term  e, 

represents  the  declaration  of  the  function  x]' . 

An  example  of  an  RMS  would  be  the  following: 

Example  2.1  Consider  the  following  declaration: 


Ictrrc  f{x,y)  =  x  V  g{h{x  A  y,y),  h) 
and  g{x,y)  =  y{x,x)V  x 
andh{x,y)  =  x  A  {y  \/  f{y,  x)) 

The  RMS  S  =  ((x[‘ ,  Cl),  (^2^,  62),  (.T3®,  63))  will  represent  the  above  dec¬ 
laration,  where: 


ei 


0-2 


f’3 

r\ 

T2 

T3 


■^VivUvi  u  n  y2> ^2).  ^?)] 

n  (2/2  LJ  ^f{y2->  J/i))] 

h^ 

{b  X  {b^  b))  ^  b 

b^  b  a 
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Definition  2.9  Let  the  depth  of  the  RMS  S  =  ((x[',  ei), . . . ,  fn))  be: 

depth{S)  =  max  {dept li{ei)) 

1  <i<n 

2.4  Interpretations 

The  purpose  of  types  is  to  denote  the  “functional  behavior”  of  what  each 
typed  object  is  representing.  Let  I  J  denote  a  “type  interpreter”  that  assigns 
a  set  of  functions  to  each  type  derivable  under  our  definitions. 

•  I^I  =  2. 

•  Let  r  =  (ti  X  Ta  X  . . .  X  r„)  — f  r„+i. 

It'I  =  (It’iI  X  |r2l  X  . . .  X  |r„|)  -4  |r„+i|. 

VVe  will  now  introduce  the  idea  of  a.ssigning  (semantic)  values  to  (syntac¬ 
tic)  terms.  Let  an  environment,  p,  represent  a  mapping  from  typed  variables 
x”  to  elements  in  |rj.  Then  for  any  term  e,  where  e  :  r,  we  can  assign  an 
element  in  |rj  to  e.  Let  \e\p  denote  that  element  and  be  inductively  defined 
as  follows: 

•  =  1  and  =  T. 

•  l^^\p  = 

•  Ie(ei,  62, ,  e„)]]/9  =  [ej/?  (Ie,|/),  |e2l/0, . . . ,  Ie„I/9). 

•  lei  n  cjI/j  =  Ieil/7  A  Ie2l/9. 

where  A  is  the  pointwise  gib  operator. 

•  fei  U  e2\p  =  Ieil/9  V  Ie2]|^. 

where  V  is  the  pointwise  lub  operator. 

•  IA(x['x2"  .  ..x^^).e\p  =  /  G  /I(ri  x  •  •  •  r„)  rj 

where  e  :  r  and  /(di ,  ^2,  •  •  • ,  <^n)  =  ^  (1  ^  ^  G 

l[r,I).  p[x]'  I-+  di\  denotes  the  environment  obtained  from  0  by  letting 
x['  be  mapped  to  d,. 
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Lei  RMS  N  =  . (j’lV'- ^ would  like  to  define  an  en¬ 
vironment  ps  that  will  assign  to  ('aeh  typed  symbol  ./•[’  (1  <  ?  <  the 

element  of  |[r,|.  such  that  pix]')  (1  <  i  <  »)  are  the  least  fixed  points  of  the 
following  set  of  equations: 


( 


1 


,1 


''2 

1 


( 


2 


There  are  many  algorithms  for  generating  ps  [13]. 

Thus  for  any  RMS  5  -  ((•?■!'.<  i) . {•''n'^f’n))  ancl  any  well-typed  term 

f.  such  that  r  :  r  and  F\'{c)  C  {./•[', J'p . a;,"},  we  can  assign  to  (  an 

element  in  |r|  by  computing  fejp.s.  This  is  the  value  of  the  term  e  when 
its  ‘'function  calls"  are  interpreted  as  within  the  scope  of  the  declarations 
represented  by  .S. 

2.5  Problems 

In  this  section,  we  will  formalize  some  interesting  decidability  problems  rel¬ 
evant  to  RMS's  and  strictness  analysis. 

Definition  2.10  A'tli-Order  RMS  Evaluation  Problem  (kE): 

5  G  kE  iff: 

•  —  ((•'■['•ti), - (j:'  ic  an  RMS  with  depth{S)  <  k  and  =  b. 

•  Psi^i)  =  T 

Theorem  2.1  The  IE  evaluation  problem  is  complete  in  deterministic  ex¬ 
ponential  time  [3.  6]. 

Theorem  2.2  In  general,  as  the  depth  of  the  RMS  increases,  the  complexity 
of  its  evaluation  decidability  problem  grows  as  a  composition  of  exponentials 
[3.  J2J. 
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3  Improving  the  Running  Time 

The  results  of  Tlieorcuns  2.1  and  2.2  .sugge.st  that  the  aetua!  implementation 
of  strictness  analysis  is  intractable  due  to  its  deterministic  exponential  (and 
in  higher-order  cases  “super-exponential")  time  behavior.  However,  there  are 
ways  of  improving  the  running  time.  One  approach  is  to  devise  algorithms 
whose  “average  case"  behavior  is  significantly  better  than  exponential  (or 
super-exponential)  time.  Examples  of  such  methods  are  Frontier  Analysis 
and  Pending  Analysis.  See  [5,  2]  for  details. 

Another  approach  is  to  improve  the  running  time  of  strictness  analysis  by 
restricting  the  amount  of  information  that  can  be  extracted  from  abstract  in¬ 
terpretation.  Abstract  interpretation  approximates  conditionals  by  mapping 
them  to  Apca.p  n  (cU  a).  However,  by  “coarsening”  abstract  interpretation 
to  treat  conditionals  as  though  they  were  ordinary  algebraic  functions  that 
were  known  to  be  strict  only  in  the  first  argument  (i.e.  map  conditionals  to 
Xpca.p),  we  get  a  “coarser”  approximation  of  strictness  analysis.  However, 
sucli  a  sacrifice  in  accuracy  could  yield  a  significant  improvement  in  running 
time.  VV^e  refer  to  this  version  of  strictness  analysis  as  conjunctive  strictness 
analysis. 

Conjunctive  strictness  analysis  reduces  to  the  evaluation  of  RMS's  whose 
syntactic  definition  does  not  contain  any  occurrences  of  the  “U”  symbol.  We 
refer  to  such  a  RMS  as  a  recursive  conjunctive  monotone  scheme  or  RCMS. 
We  define  the  k-ih  Order  RCMS  Evaluation  Problem  (kEC)  as  the  special 
case  of  the  kE  evaluation  problem  where  the  RMS  evaluated  is  also  an  RCMS. 

With  minor  corrections  and  revisions  on  the  work  by  [7],  it  can  be  shown 
that  the  lEC  decidability  problem  takes  deterministic  linear  time.  This  im¬ 
provement  in  running  time  (i.e.  from  deterministic  exponential  to  linear  time) 
is  not  as  impressive  as  it  may  seem,  since  the  “coarsened”  strictness  analy¬ 
sis  significantly  limits  the  amount  of  information  that  can  be  approximated. 
However,  it  would  be  interesting  to  see  if  the  2EC  decidability  problem  re¬ 
mains  tractable  (i.e.  remains  in  polynomial  time),  if  it  becomes  exponential 
in  complexity,  or  perhaps  stays  super  exponential.  The  next  section  will  show 
that  2EC  is,  in  fact,  AT’-hard. 
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4  Second- Order  Conjunctive  Strictness 

riie  ‘iEC’  (ItH  idal.iility  problem  is  AV"*  liard  because  there  exists  a  polynomial 
time  many  one  reduction  from  tlie  (ISAT  decidability  problem.  W’e  first 
formally  define  the  CSAT  problem  (Section  4.1).  Then  we  show  a  poly-time 
many-one  reduction  from  CIS.AT  to  2F.('  (Section  4.2). 

4.1  GSAT 

The  (iS.\d'  decidability  i)roblem  (the  satisfiability  ])roblem  for  propositional 
calculus)  is  a  commonly  known  A'P-complete  problem.  This  paper  will  as¬ 
sume  the  reader’s  knowledge  of  the  syntax  of  propositional  cal'^ulus  formulas 
as  well  as  the  concepts  of  Boolean  variables,  truth-assignments,  verifying  and 
satisfiability.  For  more  details,  please  refer  to  [4.  8]. 

Definition  4.1  CiS.AT  Problem: 

CSAT  =  {o{pi,p2. ...  ■,Pn)\4>  (I  pi’oposilioual  calculus  formula  irilh  Boolean 

variables  p^.pi . Pn  under  (he  operators  -',A,  and  V,  such  that  6  has  a 

satisfying  truth- assignment} 

W’e  will  define  a  truth-assignment.  A,  for  the  Boolean  variables  of  (p  as 
a  mapping  from  the  Boolean  variables  of  <l>  to  the  set  {1,0},  where  1  and  0 
denote  the  Boolean  values  true,  and  false,  respectively. 

The  GS.AT  decidability  problem  is  A’P-complete  [4,  8]. 

4.2  GSAT  and  2EC 

In  this  section,  we  show  a  poly-time  many-one  reduction  from  GS.AT  to  2Er. 
Before  we  do  this  we  need  to  present  the  following  Lemma; 

Lemma  4.1  For  any  n  hit  rector  a  =  (rt„_i,  o„_2, . . . .  oo),  Ft  hiii{a.)  be  the 
integer  k  (0  <  k  <  2”)  with  the  n-bit  representation  a.  One  can  construct  for 
uny  j  (0  <  j  <  n  —  1),  a  propositional  calculus  formula  d>"(«„_i ,  .  .  .  .  uq) 

such  that: 


biii{(pj{a))  =  {bin(a)  4-  1)  mod  2” 
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In  fact,  it  is  not  hard  to  see  that  (j)'J  need  only  have  0{n)  connectives  and 
can  be  constructed,  given  n  and  j,  in  time  polynomial  in  n. 

Example  4.1  The  following  formulas  are  defined  by: 

<j)Q  :;=  ^ao 

<^i  ::=  (->00  V -ifli)  A  (go  V  Gj) 

<^2  ••=  (~'<io  V  —>ai  V  ~'a2)  A  ((gq  A  Gi)  V  G2)  CD 


We  now  introduce  a  restricted  class  of  propositional  calculus  formulas, 
called  “negation-pushed”  formulas.  A  negation-pushed  formula  is  a  proposi¬ 
tional  calculus  formula,  where  all  occurrences  of  the  “-1”  symbol  immediately 
precede  a  Boolean  variable.  Note  that  every  propositional  calculus  formula 
can  be  transformed  in  linear  time  into  a  negation-pushed  formula  by  repeated 
use  of  DeMorgan’s  Law  and  the  identity  ->->p  =  p. 

Before  we  go  any  further,  we  should  make  the  following  comments  on 
notation  to  be  used  in  this  section: 

•  Let  I  denote  the  identity  function  of  the  domain  2. 

•  Let  T  denote  the  constant  function  T  from  2  to  2. 

We  now  define  /,  a  mapping  from  propositional  calculus  formulcis  to 
conjunctive  terms  of  type  b  —*  b.  The  difficulty  in  such  a  mapping  is  how  to 
“simulate”  the  Boolean  or  operator  in  conjunctive  terms,  since  such  terms 
cannot  contain  any  occurrences  of  the  “U”  symbol.  The  way  around  such 
a  difficulty  is  to  represent  Boolean  values  with  first-order  functions  and  the 
Boolean  or  operator  with  the  composition  of  first-order  functions.  The  main 
ideas  of  the  mapping  /  can  be  summarized  below: 

•  “  imulate”  the  Boolean  value  0  with  the  first-order  function  I. 

■■  mulate”  1  with  T. 

•  “s’’"'  date”  Boolean  and  with  □,  i.e.  pointwise  gib. 
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•  ‘‘simulate”  Boolean  or  with  the  composition  of  two  first-order  functions. 
The  basis  for  these  ideas  are  formalized  in  the  following  Lemma: 
Lemma  4.2  Let  a(l)  =  T,  a(0)  =  /  and  ai,a2  €  {1,0}.  Then: 


a(ai)no(fl2)  =  a(ai  A  02) 

0(01)00(02)  o(oi  V  02) 

where  A  and  V  represent  Boolean  and  and  or  respectively,  PI  represents  point- 
wise  gib,  and  o  represents  composition. 

With  the  above  Lemma  in  mind,  we  now  formally  define  the  mapping  /. 
Given  a  propositional  calculus  formula  (;6(p„_i,p„_2, . . .  ,po),  define  /(<^)  as 
follows: 


•  1.  Transform  cf)  to  its  negation-pushed  form  4>'. 

2.  Transform  (f)'  into  a  conjunctive  term  of  type  b  b  using  a  map¬ 
ping  g  defined  inductively  as  follows: 

-  9{Pi)  Vi'"'’ 

-  9hp>)  ■■=  2?^^ 

-  9{<f>i  A  <f>2)  ::=  9{<f>.)^9{<f>2) 

-  g{(f>i  V  (P2)  ::=  g{(f>i)  o  g{(f>2) 

where,  for  clarity  purposes,  the  above  is  an  abbreviation  for: 
9{<f>i  V  (f>2)  ■■■=  ^a:^[{p(<^i)}({p((^2)}(a^‘))] 


/  can  obviously  be  computed  in  0{\<i>\)  time. 


Example  4.2  Consider  the  following  propositional  calculus  formula  (f: 


<p  ::=  po  V  -’(pi  A  -'P2) 
Then  f{(f>)  is  defined  by: 

/(<>)  ::=  yf  o  zj-*  o  jJ-*  □ 
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Before  we  make  any  formal  conclusions  about  /,  we  must  introduce  the 
concept  of  a  well-formed  environment . 


Definition  4.2  A  well-formed  environment,  p,  is  an  environment  such  that, 
V  *'•  p{y^r^)  e  {TJ}  and: 


P{ 


/,  tfp{y^r'^)=T 

T’,  otherwise 


We  can  now  make  the  following  conclusion  about  /; 


Lemma  4.3  Let  . .  -  ,Po)  he  a  Boolean  formula  and  A  be  a  truth- 

assignment  for  the  variables  of  4).  Then: 


•  If  A  verifies  (j),  then  l_f{<f>)\pA  =  T 


•  If  A  does  not  verify  <f>,  then  |I/(<^)lp,4  =  / 


where  p^  is  the  well-formed  environment  defined  by: 


{ 


/, 


if  A{pi)  =  1 
otherwise 


Proof  of  Lemma  4.3:  Can  be  shown  by  induction  on  the  length  of  (f>  using 
Lemma  4.2.  □ 

With  the  above  Lemma,  we  can  now  show  a  poly-time  many-one  reduction 
from  GSAT  to  2EC. 

Theorem  4.1  GSAT  is  poly-time  many-one  reducible  to  2EC. 

Proof  of  Theorem  4.1:  For  any  Boolean  formula  (;^(p„_]  ,p„_2, . . .  ,Po), 
define  ~*^*’~**’\  ^i),  (xj"***,  62),  (xg,  63))  to  be  an  RCMS  of  depth 

2,  where: 


Cl 


62 

e.-j 


mu  )j{-K-x),  mu),  ■■■Jius)) 
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where  the  mapping  /  is  previously  defined,  and  the  formulas  are  defined 
by  Lemma  4.1. 

By  Lemma  4.1  and  our  semantics  defined  in  Section  2.4: 


*’2  O  I/(<^)1pi  O  |/(<^)Ip2  O  •  •  •  O  I/((^)Ip2"-l  O 


where  o  denotes  composition  and  pk,  k  =  6fn((a„_i,  a„_2, . . . ,  Oo)),  is  the 
well-formed  environment  defined  by: 


T  if  a,  =  1 
I  otherwise 


(0  <  2  <  72  —  1) 


By  Lemma  4.3,  if  <p  is  satisfiable,  then,  for  some  ^*  <  2",  [/(<?^)lpjt  =  T. 
(For  all  other  k  <  2",  lf{4>)}pk  =  !•)  This  will  mean  that  =  T. 

This  implies  that  I.r3Jp5<^  =  T. 

Also  by  Lemma  4.3,  if  (f)  is  not  satisfiable,  then,  for  all  k  <  2",  umpk  = 
I.  Since  ps^  must  specify  least-fixpoints,  this  means  that  |Ia:2~‘*’|ps^  =  2-2^2 
(i.e.  the  constant  function  ±  from  2  to  2).  This  implies  that  =  ±. 

Thus  <j>  €  GSAT  iff  e  2EC. 

We  have  shown  that  f{(f>)  takes  0(|<^|)  time  to  compute.  We  have  also 
shown  that  each  takes  polynomial  time  to  compute  and  has  0(72)  con¬ 
nectives.  Thus  the  entire  reduction  from  <i>  to  can  be  done  in  polynomial 
time.  □ 

We  can  now  conclude  that  the  2EC  evaluation  problem  is  A^P-hard. 

Example  4,3  Consider  the  propositional  calculus  formula  (j>  used  in  Exam¬ 
ple  ^.2,  where: 


4>  ::=  po  V  -‘{pi  A  -'P2) 
By  using  the  above  reduction,  we  construct: 

where  5^  is  an  RCMS  with  depth  2  and: 
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(/(<^^),  fh<f>l).  fi<l>",l  /(-<^?),  /(^o),  fh4)) 

62  ::=  x[‘^‘^®-f^■*‘’^(AI^3r^Ax^T^A3•^x^Ax^T^Ax^x^Aa:^T'’) 

es  ::=  x^'(±'’) 


where  mapping  f  is  previously  defined  and  formulas  (f>l  are  constructed  in 
Example  4-1-  Specifically: 


m 

f{4) 

f{-<i>l) 

/(^?) 

M) 

fi-4) 


o  n  o  /r') 

(2^^  0  z\-<>  o  2^-')  n  ((t/^^  n  yj-')  0  yl-'>) 
(yr'  n  y\-^  n  y^-”)  o  ((2^'  o  n  zt') 


5  Open  Problems 

In  this  paper,  we  have  presented  and  analyzed  the  2EC  evaluation  problem. 
We  have  concluded  that  2EC  (and  therefore  second-order  conjunctive  strict¬ 
ness  analysis)  is  A'P-hard.  This  result  obviously  leaves  many  open  problems 
left  to  examine.  There  are  two  very  important  open  questions  about  2EC 
worth  noting. 

5.1  Deterministic  Exponential  Time  Algorithm? 

Since  we  have  not  presented  an  algorithm  to  solve  2EC,  we  cannot  comment 
on  the  “upper  bound”  time  complexity  of  2EC.  Since  2E  can  be  solved  in 
“double-exponential”  time,  we  know  2EC  can  be,  too.  However,  it  seems 
possible  that  the  2EC  problem  can  be  solved  with  a  deterministic  exponential 
time  algorithm. 
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5.2  Deterministic  Exponential  Hard? 

The  idea  of  using  first-order  functions  to  represent  Boolean  values  suggests 
that  the  IE  problem  can  be  poly-time  many-one  reduced  to  2EC.  Specifically: 

•  We  ‘‘simulate”  the  value  J_  with  the  first-order  function  I. 

•  We  “simulate”  the  value  T  with  the  first-order  function  T. 

•  We  “simulate”  the  lub  operator  U  with  the  composition  of  tw’o  func¬ 
tions. 


Informally,  we  illustrate  the  above  idea  with  an  example.  Consider  the 
following  depth  1  RMS  S  -  ((xf  Ci),  (a;f  62),  (X3,  63)),  where: 


ei  ::= 

[j/l  L-l  (j/2  ^2  *(?/3’!/3))] 

e2  :;=  AyJyj. 

[y^Uxf^'’(yJUy^,y^,yJ)] 

63  ::=  xf^‘’(T',T'’) 

We  reduce  S  to  the  following  depth  2  RCMS: 

S'  =  el),  el).  (x‘-‘,  el),  (xj,  el)) 


where: 


e'  ::=  AyJ-*\*^*. 

[y^  o  o  y^^  y^^  y^^)] 


::=  X 


b—*b( 


'(Ax^T^Ax^x*’) 

‘3 

It  would  seem  that  [X3IP5  =  T  iff  fx^jps'  =  T. 
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Unfortunately,  this  straightforward  reduction  is  incorrect.  In  some  cases 
of  depth  1  RMS’s,  the  resulting  depth  2  ROMS  S'  will  not  “behave”  properly. 
Specifically,  the  problem  lies  within  the  specification  of  least-fixpoints  in  the 
environment  ps'.  Because  of  recursion,  ps>  sometimes  map  terms  to  the 
constant  ±  function  from  2  to  2,  when  we  would  like  the  terms  to  be  mapped 
to  either  I  or  T.  (In  fact,  in  the  proof  of  Theorem  4.1,  we  see  that 
maps  to  the  constant  ±  function  instead  of  I,  when  (j>  is  not  satisfiable.)  This 
will  cause  complications  in  the  reduction. 

For  example,  let  S  =  ((xf  “***, ei), ^2)),  where: 

ei  ::=  U  yj  U  xf  □  y^)] 

62  ::= 

Then  S'  =  ”*^*'~**’\  ^2)5  (^3)  ^3))  's  the  depth  2  ROMS 

defined  by  the  reduction,  where: 

el  ::=  Ayl-*‘y^\ 

::=  X2(±^) 

It  is  clear  that  Ix^ps  =  T.  However,  since  ps'  must  specify  least- 
fixpoints: 

This  means  that  Ixyp5<  =  ±.  Thus  the  above  reduction  does  not  hold. 

It  is  the  opinion  of  the  author  that  such  problems  may  be  correctable, 
but  more  work  must  be  done  in  order  to  do  so. 
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